Predicting perceived ethnicity with data on personal names in Russia
نویسندگان
چکیده
Abstract In this paper, we develop a machine learning classifier that predicts perceived ethnicity from data on personal names for major ethnic groups populating Russia. We collect VK, the largest Russian social media website. Ethnicity was coded languages spoken by users and their geographical location, with manually cleaned crowd workers. The shows accuracy of 0.82 scheme 24 0.92 15 aggregated groups. It can be used research relations in Russia, sets have but not ethnicity.
منابع مشابه
An ontology of ethnicity based upon personal names with implications for neighbourhood profiling
Understanding of the nature and detailed composition of ethnic groups remains key to a vast swathe of social science and human natural science. Yet ethnic origin is not easy to define, much less measure, and ascribing ethnic origins is one of the most contested and unstable research concepts of the last decade not only in the social sciences, but also in human biology and medicine. As a result,...
متن کاملPredicting coping self-efficacy based on social support, personal growth, and mindfulness in people with cancer
One of the most significant and complex health issues in our country is cancer. Coping with psychological syptoms of cancer such as stress, anxiety and depression is the most challenging blind spot for patients suffering from the cancer. The purpose of this study is to predict coping self-efficacy based on social support, personal growth and mindfulness in people with cancer. 120 participants w...
متن کاملPersonal Names in Modern Turkey
We analyzed the most common 5000 male and 5000 female Turkish names based on their etymological, morphological, and semantic attributes. The name statistics are based on all Turkish citizens who were alive in 2014 and they cover 90% of all population. To the best of our knowledge, this study is the most comprehensive data-driven analysis of Turkish personal name inventory. Female names have a g...
متن کاملPredicting Webcasting Adoption via Personal Innovativeness and Perceived Utilities
CAROLYN A. LIN University of Connecticut [email protected] Broadcasting over the internet presents a new frontier for media and advertising industries to conquer. At the local level, the greatest asset of a television station is its "localism"—with the audience still regarding television stations as the most effective source for local weather, traffic, and sports news as well as advertising...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of computational social science
سال: 2023
ISSN: ['2432-2725', '2432-2717']
DOI: https://doi.org/10.1007/s42001-023-00205-y